Goto

Collaborating Authors

 Alameda


Former Neuralink Exec Launches Organ Preservation Effort

WIRED

Science Corporation, founded by former Neuralink president Max Hodak, has unveiled a prototype machine to extend the life of organs for longer periods. Science Corporation, the brain-computer interface startup founded in 2021 by former Neuralink president Max Hodak, is launching a new division of the company with the goal of extending the life of human organs. Alameda, California-based Science is aiming to improve on current perfusion systems that continuously circulate blood through vital organs when they can no longer function on their own. The technology is used to preserve organs for transplant and as a life-support measure for patients when the heart and lungs stop working, but it's clunky and costly. Science wants to make a smaller, more portable system that could provide long-term support.


This retina implant lets people with vision loss do a crossword puzzle

MIT Technology Review

Competition to deploy commercial brain-computer interfaces is heating up. A microelectronic chip placed under the retina can produce vision. Science Corporation--a competitor to Neuralink founded by the former president of Elon Musk's brain-interface venture--has leapfrogged its rival after acquiring, at a fire-sale price, a vision implant that's in advanced testing,. The implant produces a form of "artificial vision" that lets some patients read text and do crosswords, according to a report published in the today . The implant is a microelectronic chip placed under the retina. Using signals from a camera mounted on a pair of glasses, the chip emits bursts of electricity in order to bypass photoreceptor cells damaged by macular degeneration, the leading cause of vision loss in elderly people.


RAGferee: Building Contextual Reward Models for Retrieval-Augmented Generation

Coman, Andrei C., Sorodoc, Ionut-Teodor, Ribeiro, Leonardo F. R., Byrne, Bill, Henderson, James, de Gispert, Adrià

arXiv.org Artificial Intelligence

Existing Reward Models (RMs), typically trained on general preference data, struggle in Retrieval Augmented Generation (RAG) settings, which require judging responses for faithfulness to retrieved context, relevance to the user query, appropriate refusals when context is insufficient, completeness and conciseness of information. To address the lack of publicly available RAG-centric preference datasets and specialised RMs, we introduce RAGferee, a methodology that repurposes question-answering (QA) datasets into preference pairs that prioritise groundedness over stylistic features, enabling the training of contextual RMs better suited to judging RAG responses. Using RAGferee, we curate a small preference dataset of 4K samples and fine-tune RMs ranging from 7B to 24B parameters. Our RAG-centric RMs achieve state-of-the-art performance on ContextualJudgeBench, surpassing existing 70B+ RMs trained on much larger (up to 2.4M samples) general corpora, with an absolute improvement of +15.5%.


Cognitive Agents Powered by Large Language Models for Agile Software Project Management

Cinkusz, Konrad, Chudziak, Jarosław A., Niewiadomska-Szynkiewicz, Ewa

arXiv.org Artificial Intelligence

This paper investigates the integration of cognitive agents powered by Large Language Models (LLMs) within the Scaled Agile Framework (SAFe) to reinforce software project management. By deploying virtual agents in simulated software environments, this study explores their potential to fulfill fundamental roles in IT project development, thereby optimizing project outcomes through intelligent automation. Particular emphasis is placed on the adaptability of these agents to Agile methodologies and their transformative impact on decision-making, problem-solving, and collaboration dynamics. The research leverages the CogniSim ecosystem, a platform designed to simulate real-world software engineering challenges, such as aligning technical capabilities with business objectives, managing interdependencies, and maintaining project agility. Through iterative simulations, cognitive agents demonstrate advanced capabilities in task delegation, inter-agent communication, and project lifecycle management. By employing natural language processing to facilitate meaningful dialogues, these agents emulate human roles and improve the efficiency and precision of Agile practices. Key findings from this investigation highlight the ability of LLM-powered cognitive agents to deliver measurable improvements in various metrics, including task completion times, quality of deliverables, and communication coherence. These agents exhibit scalability and adaptability, ensuring their applicability across diverse and complex project environments. This study underscores the potential of integrating LLM-powered agents into Agile project management frameworks as a means of advancing software engineering practices. This integration not only refines the execution of project management tasks but also sets the stage for a paradigm shift in how teams collaborate and address emerging challenges.


Navigating Text-to-Image Generative Bias across Indic Languages

Mittal, Surbhi, Sudan, Arnav, Vatsa, Mayank, Singh, Richa, Glaser, Tamar, Hassner, Tal

arXiv.org Artificial Intelligence

This research investigates biases in text-to-image (TTI) models for the Indic languages widely spoken across India. It evaluates and compares the generative performance and cultural relevance of leading TTI models in these languages against their performance in English. Using the proposed IndicTTI benchmark, we comprehensively assess the performance of 30 Indic languages with two open-source diffusion models and two commercial generation APIs. The primary objective of this benchmark is to evaluate the support for Indic languages in these models and identify areas needing improvement. Given the linguistic diversity of 30 languages spoken by over 1.4 billion people, this benchmark aims to provide a detailed and insightful analysis of TTI models' effectiveness within the Indic linguistic landscape.


A Multi-Agent Reinforcement Learning Framework for Evaluating the U.S. Ending the HIV Epidemic Plan

Sharma, Dinesh, Shah, Ankit, Gopalappa, Chaitra

arXiv.org Artificial Intelligence

Human immunodeficiency virus (HIV) is a major public health concern in the United States, with about 1.2 million people living with HIV and 35,000 newly infected each year. There are considerable geographical disparities in HIV burden and care access across the U.S. The 2019 Ending the HIV Epidemic (EHE) initiative aims to reduce new infections by 90% by 2030, by improving coverage of diagnoses, treatment, and prevention interventions and prioritizing jurisdictions with high HIV prevalence. Identifying optimal scale-up of intervention combinations will help inform resource allocation. Existing HIV decision analytic models either evaluate specific cities or the overall national population, thus overlooking jurisdictional interactions or differences. In this paper, we propose a multi-agent reinforcement learning (MARL) model, that enables jurisdiction-specific decision analyses but in an environment with cross-jurisdictional epidemiological interactions. In experimental analyses, conducted on jurisdictions within California and Florida, optimal policies from MARL were significantly different than those generated from single-agent RL, highlighting the influence of jurisdictional variations and interactions. By using comprehensive modeling of HIV and formulations of state space, action space, and reward functions, this work helps demonstrate the strengths and applicability of MARL for informing public health policies, and provides a framework for expanding to the national-level to inform the EHE.


MeetingBank: A Benchmark Dataset for Meeting Summarization

Hu, Yebowen, Ganter, Tim, Deilamsalehy, Hanieh, Dernoncourt, Franck, Foroosh, Hassan, Liu, Fei

arXiv.org Artificial Intelligence

As the number of recorded meetings increases, it becomes increasingly important to utilize summarization technology to create useful summaries of these recordings. However, there is a crucial lack of annotated meeting corpora for developing this technology, as it can be hard to collect meetings, especially when the topics discussed are confidential. Furthermore, meeting summaries written by experienced writers are scarce, making it hard for abstractive summarizers to produce sensible output without a reliable reference. This lack of annotated corpora has hindered the development of meeting summarization technology. In this paper, we present MeetingBank, a new benchmark dataset of city council meetings over the past decade. MeetingBank is unique among other meeting corpora due to its divide-and-conquer approach, which involves dividing professionally written meeting minutes into shorter passages and aligning them with specific segments of the meeting. This breaks down the process of summarizing a lengthy meeting into smaller, more manageable tasks. The dataset provides a new testbed of various meeting summarization systems and also allows the public to gain insight into how council decisions are made. We make the collection, including meeting video links, transcripts, reference summaries, agenda, and other metadata, publicly available to facilitate the development of better meeting summarization techniques. Our dataset can be accessed at: https://meetingbank.github.io


Clustering US Counties to Find Patterns Related to the COVID-19 Pandemic

Brown, Cora, Milstein, Sarah, Sun, Tianyi, Zhao, Cooper

arXiv.org Artificial Intelligence

When COVID-19 first started spreading and quarantine was implemented, the Society for Industrial and Applied Mathematics (SIAM) Student Chapter at the University of Minnesota-Twin Cities began a collaboration with Ecolab to use our skills as data scientists and mathematicians to extract useful insights from relevant data relating to the pandemic. This collaboration consisted of multiple groups working on different projects. In this write-up we focus on using clustering techniques to help us find groups of similar counties in the US and use that to help us understand the pandemic. Our team for this project consisted of University of Minnesota students Cora Brown, Sarah Milstein, Tianyi Sun, and Cooper Zhao, with help from Ecolab Data Scientist Jimmy Broomfield and University of Minnesota student Skye Ke. In the sections below we describe all of the work done for this project. In Section 2, we list the data we gathered, as well as the feature engineering we performed. In Section 3, we describe the metrics we used for evaluating our models. In Section 4, we explain the methods we used for interpreting the results of our various clustering approaches. In Section 5, we describe the different clustering methods we implemented. In Section 6, we present the results of our clustering techniques and provide relevant interpretation. Finally, in Section 7, we provide some concluding remarks comparing the different clustering methods.


Startup Funding: September 2022

#artificialintelligence

The onshoring and buildout of dozens of fabs, many costing tens of billions of dollars, is beginning to spill over into other areas that are critical for chip manufacturing. Materials, in particular, which often gets little attention outside of chip manufacturing, witnessed a big spike in September 2022. In fact, seven materials companies covered in this report made up more than a third of the month's total reported investments, with three of the companies garnering more than $200 million. Other investment targets were sputtering equipment and evaporation materials for deposition, high-purity polycrystalline silicon, fluorine-containing electronic gases, and silicon carbide. In the AI hardware arena, numerous startups are focusing on in-memory and near-memory compute, reducing the volume of data that needs to be moved back and forth between memory and processing elements. Novel architectures also are appearing, such as one that uses sparse mathematics.


Automakers Report Nearly 400 Crashes of Cars That Used Driver-Assist Tech

TIME - Tech

Automakers reported nearly 400 crashes over a 10-month period involving vehicles with partially automated driver-assist systems, including 273 with Teslas, according to statistics released Wednesday by U.S. safety regulators. The National Highway Traffic Safety Administration cautioned against using the numbers to compare automakers, saying it didn't weight them by the number of vehicles from each manufacturer that use the systems, or how many miles those vehicles traveled. Automakers reported crashes from July of last year through May 15 under an order from the agency, which is examining such crashes broadly for the first time. "As we gather more data, NHTSA will be able to better identify any emerging risks or trends and learn more about how these technologies are performing in the real world," said Steven Cliff, the agency's administrator. Tesla's crashes happened while vehicles were using Autopilot, "Full Self-Driving," Traffic Aware Cruise Control, or other driver-assist systems that have some control over speed and steering.